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1 Introduction 



1.1 Sequence comparison setting 

Throughout this paper X = (Xi, X2, ■ ■ ■ X n ) and Y = (Yi, Y2, ■ ■ -Y n ) are two random 
strings, usually referred as sequences, so that every random variable Xi and Yi take val- 
ues on a finite alphabet A. We shall assume that the sequences X and Y have the same 
distribution and are independent. The sample space of X and Y will be denoted by X n . 
Clearly X n C A" but, depending on the model, the inclusion can be strict. 
The problem of measuring the similarity of X and Y is central in many areas of appli- 
cations including computational molecular biology [HJ [T5J [Ml ESI HI] and computational 
linguistics (I2J [27J [311 [32] . In this paper, we adopt the same notation as in [25J , namely 
we consider a general scoring scheme, where S : A x A — > M + is a pairwise scoring func- 
tion that assigns a score to each couple of letters from A. An alignment is a pair (p, r) 
where p = (pi, p 2 , . . . , p k ) and p = (r 1; r 2 , . . . , r k ) are two increasing sequences of natural 
numbers, i.e. 1 < p\ < p 2 < ... < p k < n and 1 < T\ < r 2 < . . . < r k < n. The integer k 
is the number of aligned letters, n — k is the number of gaps in the alignment. Note that 
our definition of gap slightly differs from the one that is commonly used in the sequence 
alignment literature, where a gap consists of maximal number of consecutive indels (in- 
sertion and deletion) in one side. Our gap actually corresponds to a pair of indels, one in 
X-side and another in F-side. Since we consider the sequences of equal length, to every 
indel in X-side corresponds an indel in y-side, so considering them pairwise is justified. 
In other words, the number of gaps in our sense is the number of indels in one sequence. 
We also consider a gap price 5. Given the pairwise scoring function S and the gap price 
5, the score of the alignment (tt, p) when aligning X and Y is defined by 



In our general scoring scheme S can also be positive, although usually 5 < penalizing 
the mismatch. For negative 5, the quantity —5 is usually called the gap penalty. The 
optimal alignment score of X and Y is defined to be 

L n := L(X,Y) := maxU {p>T) (X,Y), (1.1) 

where the maximum above is taken over all possible alignments. To simplify the notation, 
in what follows, we shall denote Z := (X, Y) so that L n = L(Z). 

When 5 = and the scoring function assigns one to every pair of similar letters and 
zero to all other pairs, i.e. 



then L(Z) is just the maximal number of aligned letters - the length of the longest common 
subsequence (abbreviated by LCS). The longest common subsequence is probably the most 
common measure of global similarity between strings. 



fV r) (X, Y) := S(X pi , Y n ) + S(n - k). 
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1.2 The variance problem: history and the state of art 



Since X, Y are random string, the optimal score L n is a random variable. In order 
to distinguish related pairs of strings from unrelated ones, it is relevant to study the 
distribution of L n for independent sequences. When X and Y are take from an ergodic 
processes then, by Kingman's subbaditive ergodic theorem, there exists a constant 7 such 
that 

— — >■ 7 a.s. and in L\, as n — > 00. (1.3) 

n 

In the case of LCS, namely when S is taken as in ( II. 2p . the constant 7 is sometimes 
called the Chvatal-Sankoff constant and its value, although well estimated (see [U |5l 
1361 [T3| IT2| |33| l28j |23| [191 122] ) remains unknown even for as simple cases as i.i.d. Bernoulli 
sequences. The existence of 7 was first noticed by Chvatal and Sankoff in their pioneering 
paper [TO], where they proved that the limit 

7 ;= lim Ehi (1.4) 

exists. In [1] the rate of the convergence in (jl.4p was for the first time established, and 
in [25] the authors improved the previous results introducing a new technique based on 
entropy and combinatorics, which gives a little more bout the path structure of the optimal 
alignments. 



The fluctuations of L n . To make inferences on L n , besides the convergence (11.31) . the 

size of the variance Var[L n ] is essential. Unfortunately, not much is known about Var[L n ] 
and its asymptotic order is one of the central open problems in string matching theory. 
Monte-Carlo simulations lead Chvatal and Sankoff in [10J to conjecture that, in the case 
of LCS, Var[L n ] = o(ns) for i.i.d. —Bernoulli sequences. Using an Efron-Stein type of 
inequality, Steele [36] proved that there exist a constant B < 00 such that Var[L„] < Bn. 
In [38], and always in the LCS case, Waterman asks whether this linear bound can be 
improved. His simulations show that this is not the case and Var[L n ] should grow linearly. 
Still in the LCS case, Boutet de Monvel [7] interprets his simulations the same way. In a 
series of papers containing different settings, Matzinger et al. have been investigating the 
asymptotic behavior of Var[L n ]. Their goal is to find out whether there exists a constant 
b > (not depending on n) such that Var[L n ] > bn. Together with Steele's bound, this 
means that bn < Var[L n ] < Bn, i.e. Var[L n ] = 0(n) (we say that a sequence a n is of 
order 0(n) if, there exist constants < b < B < 00 such that bn < a n < Bn for all n 
large enough). So far, most of the research to show that Var[L n ] = 0(n) has been done 
in the case of LCS: 

• In [8J, X is a ^-Bernoulli binary sequence and Y is a non-random periodic binary 
sequence, 

• In [6], X is a |-Bernoulli binary sequence and Y is an i.i.d. random sequence over a 
3 - symbols alphabet, 
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• In [20], both X and Y are ^-Bernoulli binary sequences but they are aligned by using 
a score function which gives more weight when aligning ones than aligning zeros, 

• In [24J, both X and Y are i.i.d. binary sequences, but one symbol has much smaller 
probability than the other. That is a so called case of low entropy. 

• In [37J, both X and Y are binary sequence having a multinomial block structure. 
That is, for the first time, a so called case of high entropy. 

Another related string matching problem is the so called longest increasing subsequence 
(abbreviated by LIS) problem. Given a sequence X, to find the LIS of X is to find 
an increasing sequence of natural numbers 1 < %\ < 12 < • ■ • < %k < n such that 
< X i2 < • • • < X ik . The LIS problem can be seen as a particular case of the LCS 
problem, in the following way: Let X :— 1 2 • • • n be the sequence of the first n increasing 
integers and let cr(X) := a(l)a(2) ■ ■ ■ a(n) be the sequence of its random permutation. 
Then, a LIS of X corresponds to a LCS of X and o~(X). Due to this equivalence, it 
was thought that the LIS and the LCS have fluctuations of the same order, which now 
we know it is not true. In this direction Houdre, Lember and Matzinger [21] studied an 
hydrid problem, namely the fluctuations of i n defined as the length of the longest common 
increasing subsequence of X and Y, where X and Y are i.i.d. ^-Bernoulli binary sequences, 
and a longest common increasing subsequence of X and Y is just a LIS of X and of Y 
simultaneously. They showed that n -1 / 2 (£ n — E£ n ) converges in law to a functional of 
two Brownian motions, which implies that Var[£ n ] = 0(n) holds as well. There is also a 
connection between the LCS of two random sequences and a certain passage percolation 
problem [I]. 

For the case of general scoring, to our best knowlodge, the only previous partial results 
on fluctuations are cointained in [26] . 

1.3 Main results and the organization of the paper 

Recall that we aim to prove (or disprove) the order of the variance Var[L n ] = G(n) and 
due to Steele's upper bound it suffices to prove (or disprove) the existence of b > so 
that Var[L n ] > bn. All available proofs (by Matzinger et al.) of the existence of such b 
follow more or less the same philosophy and can be split into two parts, strategy that 
we call two-step approach. The first part of this approach is to find a random mapping 
independent of Z, usually called a random transformation, 

1Z : X n x X n — > X n x X n 

such that, for most of the outcomes z G X n x X n of Z, increases the score at least by 
some fixed amount e D > 0. More precisely, the random transformation should be that for 
some a > there exists a set B n C X n x X n having probability at least 1 — exp[— n a ] so 
that, for every z G B n , the expected score of TZ(z) exceeds the score of z by e D (where the 
expectation is taken over the randomness involved in the transformation), namely 

E[L{Tl{z))\ >L(z)+e . 
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Before stating this requirement formally, let us introduce a useful notation: Z := 7Z(Z). 
Thus Z is obtained from Z by applying a random modification to Z and the additional 
randomness is independent of Z. Formally, the first step of the approach is to find a 
random transformation so that for some universal constants a > and e a > 0, the 
following inequality holds: 

P(E[L(Z) - L(Z)\Z] > e ) > 1 - exp[-n a }. (1.5) 

Besides f ll.5p . the random transformation has to satisfy some other requirements. This 
other requirements and their influence on the fluctuations of L n form the second step of 
the two-step approach. Roughly speaking, there should also exist an associated function 
of Z, let us call it u(Z), so that applying the random transformation to Z increases the 
value of u. The variance of L n can be then lower bounded by the variance of U := u(Z) so 
that the constant b exists if the variance of u(Z) is linear on n. This second step is formally 
presented and explained with details in Subsection I2.3[ where we also briefly introduce 
the random transformation and the random variable U used so far. Let us remark that in 
earlier articles of the subject jH |20j |23J [37], the random transformation is not explicitly 
defined, but the variance driving random variable U is always there, and one can easily 
define the random transformation as well. Let us also mention that to show ( 11.51) for a 
suitable chosen transformation is not an easy task and, typically, needs a lot of effort. The 
second step of the approach consists of showing that ( 11.51) implies the existence of b. This 
proof depends on the model, on the chosen transformation and its components (vectors U 
and V, see Subsection 12. 3j) . One of the goals of the present paper is to present a general 
setup and a general proof for the second step. With such a general proof in hand, all the 
the future proofs of the existence of b could be remarkably shortened and simplified. Our 
general approach is based on the same strategy as in [2H E7J , but remarkably shorter and 
simpler (see also Remark 6 before the proof of Theorem \2.2\i . These general results are 
Theorem 12.11 and Theorem 12. 2[ both presented in Section [2j 

In order to see in action our two-step approach, we include two applications which bring 
us up two new fluctuation results: 

First application: optimal score of i.i.d. sequences. In Section [31 X and Y are A- 
valued i.i.d. random variables, being compared under the general scoring scheme described 
in Subsection ll.il In this case, the random transformation consists of uniformly choosing 
a specific letter a e A and turning it into another specific letter b G A. In [26], it 
has been proven that when the gap price is relatively low and the scoring function S 
satisfies some mild asymmetry assumptions, then the described transformation satisfies 
(II. 5p . see Theorem 13.11 and the remarks after it. Thus, for sufficiently low gap price 6, 
the first step holds true. In Section [21 we show that all other assumptions of Theorem 
12.11 and Theorem 12.21 are fulfilled, so that the second step holds true and thus there exists 
the desired constant b (Theorem 13. 2j) . Hence, Section [3] completes the study started in 
[26] and, to our best knowledge, we obtain the first result where the order of variance 
Var[L n ] = 0(n) is proven in a setup other than LCS of binary sequences. It is important 
to note that for the second step (Theorem 13.21) . no assumption on 5 nor on the scoring 



5 



function S are needed. Hence, whenever the assumptions in the first step can be relaxed 
(i.e. generalization of Theorem I3.ip . the second step still holds true and the order of 
variance Var[L n ] = 0(n) can be automatically deduced. 

Second application: The length of the LCS of random i.i.d. block sequences. 

Unfortunately, the current assumption on the gap price 5 makes Theorem 13.11 not appli- 
cable to the length of the LCS of two independent i.i.d. sequences, thus this case (except 
the special model in [21]) is still open. In order to approach this still open question from 
another point of view, in Section H] we consider X and Y not to be i.i.d. sequences any 
more, but we keep A = {0, 1} consisting of two colours (i.e. the sequences are still binary) 
and the scoring function to be (11 .2D . also 5 = 0. Hence, L n is the length of the LCS. The 
difference from the setup considered in Section [3] lies in the random structure of X and 
Y . Let us briefly explain the model. Note that any binary sequence can be considered as 
a concatenation of blocks with switching colors (from to 1 or viceversa). Here a block is 
merely a subword of the sequence having all letters of the same color and a different color 
before and after it. Hence, every binary sequence is fully determined by the lengths of its 
blocks and the color of its first block. Therefore, every infinite i.i.d. Bernoulli sequence 
X with parameter ~ can be considered as an i.i.d sequence of blocks whose lengths are 
geometrically distributed, where the first block has colour either or 1 with probability 
|. X can be, in a sense, approximated by a (binary) random sequence X with finite range 
of possible block lengths. Indeed, the probability of finding a very long block in X is very 
small, hence such an approximation of X by X is justified (note that although the blocks 
remain to be i.i.d, X is not an i.i.d. sequence any more). This is the situation in Section 
HJ instead of considering X (and Y) as the first n elements of an i.i.d. infinite Bernoulli 
sequence with parameter |, we take them as the first n elements of an infinite sequence 
Xi, X 2 , . . . obtained by i.i.d. concatenating blocks of alternated colours of random lengths 
distributed on {I — 1, 1, 1 + 1}, where I > 2 is a fixed integer. For a formal description of 
this block model see Subsection 14.11 The restriction that the block lengths can only have 
three possible values is made in order to have a simplified exposition of the technique. 
We believe that the results in Section H] also hold for any finite range of possible block 
lengths. 

Considering such a block model is motivated by the following arguments. First, it is a 
common practice in random sequence comparison to approximate a target model (i.i.d. 
Bernoulli sequences, in our case) by some more tractable model. In random sequence com- 
parison, the more tractable model typically has lower entropy. Secondly, as it is shown in 
[37] , for the case where all three possible block lengths have equal probability, there exists 
a random transformation so that (jl.5p holds, see Lemma |4"1j1 The random transformation 
in this case - let us call it the block-transformation - is the following: pick uniformly an 
arbitrary block of X with length / — 1 and independently an arbitrary block of X with 
length 1 + 1. Then, change them both into blocks of length I. Thus, in this particular case 
where all block lengths have equal probability, the first-step is accomplished. In Section 
HJ we show that the block-transformation and corresponding random variables satisfy all 
other assumptions of Theorem 12.11 and Theorem 12. 2\ so that the existence of b follows 
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(Theorem 14. ip . Thus, for the case of equiprobable block lengths, the order of variance 
Var[L n ] = 0(n) has now been proved. Since the uniform distribution of of block lengths 
was not used in the second step (see Theorem 14. ip . it follows that the same order of vari- 
ance automatically holds if an equivalent to Lemma 14.61 without the uniform distribution 
assumption can be shown. Again, we believe that such a generalization is true. 

2 The two-step approach 
2.1 Preliminaries 

Proposition 2.1 Let N be an integer-valued random variable taking values on interval 
I. Let f : I — > M. be a monotone function so that for a c > 0, f(k) — f(k — 1) > c (or 
f(k - 1) - f(k) >c) for every k,k-lel. Then Var[/(iV)] > c 2 Var[iV]. 

For a proof of this statement see [6]. The next corollary replaces the more involved 
Lemma 3.3 in j6] or Lemma 5.0.3 in [37]. In our general approach, we need a simpler 
version because we use the decomposition (12.91) (see Remark 6. after Theorem 12. ip : 

Corollary 2.1 Let N be an integer-valued random variable taking values on the set Z : = 
{zi, z 2 , . . .} C Z, . Let k Q : — supj >2 (-Zi — i) < oo. Let f be an increasing function 
defined on Z so that for 5 > it holds 

f{zi)-f{z^ x )>5, Vz>2. 

Then 

5 2 

Var[/(A0] > T^VarLV]. 
% 

Proof. Let M be a random variable taking values on the set / = {1,2,...} defined as 
follows: M — % iff N = Zi- Let g be an increasing function on / defined as follows: 
g (i) ■= f( Zi ). Thus g(i + 1) - g(i) > 5 and g(M) = f(N). By Proposition O 

Var[/(iV)] = Vax\g{M)] > 5 2 Var[M] > (— ) 2 Var[AT], 

Kq 

where the last inequality follows from the inequality Var[iV] < fc 2 Var[M]. 



Lemma 2.1 (Chebychev's inequality) Let U be a random variable, then for any con- 
stant ( > we have 

P(\U- E[U}\ > CVvmU] ) < i- (2-1) 
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Lemma 2.2 (Hoffding's inequality) Let a > be constant and V\, V2, ■ ■ ■ be an i.i.d 

sequence of bounded random variables such that: 

P(\V l -E[V i \\<a) = l 
for every % = 1, 2, . . . Then for every A > 0, we have that: 

Vi H VV n 



> A) <2exp(-^-n) (2.2) 



n 

The following lemma follows from the local central limit theorem (section 2.5 in [18J ) : 

Lemma 2.3 Let X ~ B(m,p) be a binomial random variable with parameters m and p. 
Then, for any constant (3 > 0, there exists b(/3) and mo(/3) so that for every m > m and 

i E [mp — 0y/m, mp + /3y/m\ =: J m , 

it holds 

P(X = i) = ( m X\l-p) m - 1 > (2.3) 
V % I b\/m 



Moreover, there exists an universal constant ci(/3) > and mi(/3) so that for every m > 
mi 

Var[X|X G J m ] > c x m. (2.4) 

Applying Lemma 12.31 repeatedly on marginals, we obtain a multinomial corollary: 

Corollary 2.2 Let (X, Y, Z) be a multinomial random vector with parameters m and p\, 
Pi, P3 such that pi + P2 + P3 = 1- Then, for any constant (3 > 0, there exists b(f3) and 
m ((3) so that for every m > m and 



(i,j) G [mp 2 — (3y/m, mp 2 + P\frn\ x [mpi — (3y/m, mpi + (3y/m\, 

it holds 

P{X = i,Y = j)=(. m . .XW; ' ' > (2-5) 
\i j [m — 1 — j) J bm 

2.2 General fluctuations results 

Let X n be the sample space of X and Y so that X n x X n is the sample space of Z := (X, Y). 
In the following, we are considering the functions 

u : X n x X n — > Z, t) : X n x X n — > 1j d 

so that U := u(Z) (resp. V := X>(Z) ) is an integer values random variable (resp. vector). 
We shall denote by S n , S% an d <SX the support of (U, V), U and V, respectively. Hence 
<S n C Z d+1 , C Z and C 1> d . For every v £ , we define the fiber of S% as follows 

S n (v) := {ueS% : (u,v) eS n }. 
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For any (u,v) G S n , let 

l(u,v) := E[L{Z)\U = u,V = v}. 

In what follows, we shall often consider the conditional distribution of U given that V 
takes a particular value v. Therefore, we shall denote by a random variable that has 
this conditional distribution, i.e. for any z G Z, it holds 

P(U {V) =z) = P(U = z\V = v). 

We shall also consider the sets of "typical values" of V and U( v y More precisely, we shall 
define the sets V n C S% that contain (in some sense) the values of V that are of our 
interest. Similarly, for every v G V n , we shall define the sets U n {v) that (again in some 
sense) contains the values of U( v ) that are of our interest. Roughly speaking, in what 
follows we shall always condition on the events {V G V„} and {£/(„) G U n (v)}. 

Theorem 2.1 Assume the existence of sets V n C S% and U n {v) C S n (v), for v G V n , so 
that for some constants 5 > and k Q G N, the following conditions hold: 

1) For every v G V n and U\,U2 G U n {v) such that u\ < U2, it holds 

l(u 2 ,v)-l(u 1 ,v)>5. (2.6) 

2) There exists ip n so that for every v G V n , the following lower bound holds 

Var [[/(„) | [/(„) G U n (v)\ > ip n . (2.7) 

3) There exists k Q > oo so that for every v G V n and Ui G U n (v), there exists an u 2 G 

U n {v) so that \u\ — u 2 \ < ui + k Q . 

Then 2 

Var[L(Z)] > ( ■ ^ • V P(C/ M G Z4(u))P(y = u). (2.8) 

Proof. It is clear that 

Var[L(Z)] = E(Vax[L(Z)\U, V]) +Vai(E[L(Z)\U,V]) > Vai(l(U,V)). (2.9) 

We aim to bound Var(/(£7, V)) from below. We condition on V and use the same formula 
to get 

Vai(l(U,V)) = E(Vai[l(U,V)\V\) +Vai(E[l(U,V)\V\) > E(Vai[l(U,V)\V\) 

= Vm[1(U,v)\V = v]P(V = v) > yai[l(U,v)\V = v]P(V = v) 

vesX vev„ 

= Y / V^iKU {v) ,v)]P(V = v). (2.10) 

i>ev„ 
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Conditioning on the event {Um 6 W„}, we see that 



Vax[l(U {v) ,v)] > Vax[l(U {v) ,v)\U (v) G U n (v)]P(U {v) e U n ). 

By assumption 1), on the set U n {v) the function I satisfies (I2.6p . By assumption 3), the 
two consecutive atoms of S n (v) C\U n are at most k Q apart from each other. By Corollary 
EU thus 

5 2 

Vai[l(U (vh v)\U (v) E U n ] > — ■ Vax[U iv) \U iv) E U n \. (2.11) 
Thus (12.101) can lower bounded by 

Vai[l{U (v) ,v)]P(V = v)> (P\ ■ Var [U (v) \U (v) e U n (v)]P(U {v) e U n (v))P(V = x 



> / \ 



d " Vn- Y,P(U (!u) eU n (v))P(y = v). 

v&V n 



Remarks: 

1. The theorem above is non-asymptotic. It means that n is fixed and, therefore, could 
be removed from the statement. However, writing the theorem in such a way, we try 
to stress out that S and k should be independent of n when applying the theorem. 
Obviously X, Y, Z, U, V will depend on n too, but we do not explicitely include that 
in the notation. 

2. In order to get a linear lower bound from (12.81) . it suffices to show that for some 
constant b > it holds, 

W • P{U ^) e U n(v))P(V = v)>bn 

Typically ip n is linear on n so that for a constant d > we will have ip n > dn, and 
the sets U n (v) and V n are such that for constants d± and c?2 it holds, 

P(VeV n )>d u P(U {v) eU n (v))>d 2 , VveV n . (2.12) 

Then the right side of (12.81) has a linear lower bound as desired: 

V„ • Yl P ( U M e Un(v))P(V = v)> {d 1 d 2 d)n. 

veVn 

3. The most crucial assumption of Theorem 12.11 is assumption 1). It states that the 
function u h> l(u,v) increases at least by certain amount 5 on the set where U and 
V take their typical values. The core of the approach is to find U and V such that 
(12. 6p holds. Later in concrete settings, we shall see how (I2.6P is achieved in practice. 
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4. The condition 2) barely states the existence of an uniform lower bound for the 
conditional variance (i.e. independent of v). Some trivial bounds clearly exist, but 
as explained above, ip n has to grow linearly in order to get a linear lower bound for 
Var[L n ]. 

5. The condition 3) is of technical nature. In particular, it holds if U n (v) is a lattice of 
span k Q , i.e. for integers m and u Q 

U n (v) = {u + k , u + 2k a , ...,u + mk }. (2.13) 

As we shall see, this is a typical situation in practice. 

6. The proof is based on the decomposition (12.91) . In all previous papers, the lower 
bound of Var[L(Z)] was achieved by bounding below the (expectation) of conditional 
variance Var[L(Z) \U,V] ( [HI El EOl I2U EZ] ) • This approach often involves martingale's 
arguments (via Hoffding-Azuma inequality), non-trivial combinatorial estimates and 
a generalization of Proposition 12. 1L In this paper, however, we bound the variance 
of conditional expectation Vax(E[L(Z)\U, V]). Although the main idea remains the 
same, the proof is now much shorter and less technical, relying solely on Proposition 

o 



Corollary 2.3 Let, for any v G V n , the setU n {y) be defined asU n {v) = U n C\S n {v), where 
U n C K is a subset independent of v. If the assumptions of Theorem \2.1\ are satisfied, 
then 



<T 2 



Var[L(Z)] > (^-J ■ i> n ■ P(U eU n ,Ve V n ). (2.14) 
Proof. The (I27T4D follows from (gUJ) : 

J2 P {U(v) e U n (v))P(V = V)=J2 P i U (v) £ U n )P{V = V )=J2 P i U ^ ^ = V)P{V 

vev„ t)GV„ fsv n 

Y^P{UeU n ,V = v) = P(U eU H ,Ve V n ). 



2.3 Random transformation and the condition (12.61) 

In order to simplify the notation, we consider the case where U n is an integer interval and 
that, for any v G V n , the fiber S n (v ) is a lattice with span k Q > 1. Thus, for every v G V n 
there exists an integer m (depending on n and v ) so that S n (v) fl lA n is as in (I2.13p . As 
explained in Remark 5, in this case the condition 3) of Theorem 12.11 is fulfilled. 

For any (u, v) G S n , let P( u ,v) denote the law of of Z given U = u and V — v. Thus 

P{u,v){z) = P(Z = z\U = u,V = v). 
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Recall from the introduction that the core of the whole two-step approach is the existence 
of a random transformation 1Z : X n x X n — > X n x X n independent of Z that satisfies (II. 5p . 
In order to make this approach to work, the transformation should be associated with the 
U and V in the following way: for a typical z G X n x X n , the transformation increases 
u(z) by k Q unit and leaves v(z) unchanged. Typically there are many such (random or 
non-random) mappings, but to ensure (I2.6p . the transformation should be chosen so that 
some additional assumptions are fulfilled. Recall that Z is obtained from Z by applying 
a random modification to Z and the additional randomness is independent of Z. As 
mentioned above, the transformation increases U = u(Z) by k Q and leaves V = &{Z) 
unchanged, thus (at least for the typical values of Z), it holds 

u(Z) = u(Z) + k Q , x>(Z) = X>(Z). (2.15) 

In addition, we need the following assumptions to be true: 

Al There exist universal (not depending on n) constants a > and e D > such that 

P(E[L(Z) - L(Z)\Z] > e a ) > 1 - exp[-n a ]. 

A2 There exists universal constant A < oo so that L(Z) — L(Z) > —A. 

A3 For any (u,v) such that v G V n and u G U n {v), the following implication holds: 

If Z ~ P(u,v), then Z ~ P( u +k ,v)- (2.16) 

Remarks: 

1. The assumption Al is the condition (II. 5p explained already in Introduction. 

2. The assumption A2 states that by applying the random transformation, the maxi- 
mum decrease of the score is at most A. This assumption usually holds for trivial 
reasons. 

3. Note that (I2.16P implies (I2.15p . However, the condition (I2.16P is more restrictive and 
(except some trivial cases) to achieve it, the transformation R has to be random. 

4. If U n {v) = U n fl S n (v), then v G V n and u G U n {y) holds if and only if (u,v) G 
S n n(U n xV n ). 

Theorem 2.2 Assume the existence of a random transformation so that for every n, Al, 
A2 and A3 hold. Suppose that there exists a constant a > so that for any (u, v) such 
that v G V n and u G U n {v), it holds 

P(U = u,V = v) > — . (2.17) 
an 

Then there exists a n 5 < oo so that for every n > n 5 the assumption 1) of Theorem \2. 1\ 
is fulfilled with 8 — 4f . 
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Proof. Let v G V„ and u G U n (v). Let Zu v ) be a random vector having the distribution 
P(u,v)- By A3, thus, 

l(u + fc D ,f) = E[L(Z (UtV) )]. 

Hence 

Z(tt + k Q ,v) - l(u,v) = E[L{Z {u>v) )) - E[L(Z {u ^ v) )} = E[L(Z (UtV) ) - L{Z^ v) )} 
= E(E[L{Z{ U)V -)) — L{Z( u ,v))\Z{ u ,v)\)- 

Let B n C X n x X n be the set of outcomes of Z such that 

{E[L(Z) - L(Z)\Z] > e } = {Z G 5 n }. 

By assumption A2, for any pair of sequences z, the worst decrease of the score, when 
applying the block-transformation is —A. Hence 

E(E[L(Z [UtV) ) - L(Z {U>V) )\Z M )) > eP(Z M G B n {e)) - AP(Z M £ B n {e)). 

By Al, P(Z g B n ) < exp[-n a }. Therefore 



P(Z M £ B n ) = P{Z £ B n \U = u,V = v)< U . TT K \ r ' < anexp [-n 

where the last inequality follows from ( 12 . 17f) . Take now so big that for any n > , we 
have 

e (l — an exp[— n Q ]) — Aanexp[— n a ] > — . 
Hence, for any n > and for any (u, v) such that v G V„ and u G U n (v), we have 

/(■u + A; ,f) — l(u,v) > e D (l — an exp[— n"]) — Aanexp[—n a ] > (2.18) 



2.4 Covered previous results 

Before turning into new results presented in the subsequent sections, let us briefly mention 
how the random transformation as well as the associated random variables were defined 
in already obtained results: 

• In |8j, the random variable U is the number of matching replica points while V is 
a constant. Roughly speaking, a letter X, is a replica point if it has a neighbor- 
hood that matches exactly with the periodic sequence (i.e. it has the same periodic 
pattern). The replica point itself can or cannot match, and it is shown that the 
number of matching replica points has variance proportional to n. In |8j the random 
transformation is not explicitly defined, but one can take it as uniformly choosing a 
replica point with prescribed color and change its value. 
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• In j6] the random sequence X is built up on the alphabet {0, 1, a}. The random vari- 
able U is the number of a's in X, while V is a constant. The random transformation 
is hidden in the so called drop-scheme of random bits, used to construct the sequence 
X 01 which is the subsequence of X only having 0's and l's. Roughly speaking, the 
drop-scheme of random bits consists on, starting from a binary random sequence of 
length two, to flip a coin and to add the resulting symbol into the previous sequence 
at an uniformly chosen location, so increasing the length, until reaching a length 
n-U. 

• In [20] the scoring function is such that 5(1, 0) = 5(0, 1) = 0, 5(0, 0) = 1 and 
5(1, 1) G M. The random transformation consists, in X, to uniformly choose a block 
of length five, to take one of its symbols out and to add it to a uniformly chosen 
block of length one. The random variable U is the number of blocks of length two 
and of length four, and V is the number of blocks of the other lengths. 

• In [24], both sequences typically consist of many zeros and few ones. The random 
transformation, uniformly at random picks an arbitrary one in Z and turns it into a 
zero. The variable U is the number of ones in Z, V is a constant. Hence, this case 
is essentially the same as considered in the Section [3] with |A| = 2 and Theorem 13.21 
nicely generalizes Theorem 2.1 in [24] . 

• In [37], the random transformation and the random variables (U,V) are defined to 
be as in Section HI 

3 Optimal score of random i.i.d. sequences 

Our first application deals with the general scoring scheme as introduced in Section 11.11 
Thus let A be a finite alphabet and X, Y be independent i.i.d. sequences so that any 
letter has positive probability, i.e. 

P(c) := P(Xi = c) > 0, Vc G A. 

Clearly now X n = A™. Let 5 : A x A — > R + be a scoring function. Let A < oo be 
the maximal value of the scoring function, i.e. max^j S(a, b) < A. We naturally assume 
that the gap price does not exceed A, i.e. 5 < A. Now, it is easy but important to 
observe that changing one letter in the sequence X, say X\, decreases the score at most 
by A units. Indeed, if X\ was not included any optimal alignment, then changing it does 
not decrease the score. If an optimal alignment includes X%, then after the change, the 
previous alignment (which now need not to be optimal any more) scores at most A units 
less than before the change. And the new optimal alignment cannot score less. 

The random transformation. In this setup, the random transformation is the follow- 
ing. Recall that Z stands for the pair of sequences (X, Y). We choose two specific letters 
a and b from the alphabet A. Given the pair Z such that at least one of the sequences 
contain at least one a, we choose a letter a from Z with uniform distribution and change 
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it into the letter b. Hence Z and Z differ from one letter only and as just explained, 
the maximum decrease of score is at most A, i.e. L(Z) — L(Z) > —A i.e. the condition 
A2 is satisfied. Choosing such a transformation is motivated by the following result (c.f. 
Theorem 5.1 and Theorem 5.2 in [2"E]): 

Theorem 3.1 Suppose there exist letters a, b G A such that 

J2P(c){S(b,c)-S(a,c))>0. (3.1) 

Then, there exist constants 5o > —oo, e Q > 0, a > and no < oo such that 

P(E[L(Z) - L(Z)\Z] > e ) > 1 - e~ an (3.2) 

given 5 < 5 and n > n . 
Remarks: 

1. For the two-letter alphabet A = {a,b}, condition (13. 1 j) says 

(5(6, a) - S(a, a))P{a) + (S(b, b) - S{b, a))P(b) > 0. 

When S(b,b) = S(a,a) ^ S(a,b) = S(b,a), then ( 13.11) is satisfied if and only if 
P(a) 7^ P(b). For, example when S(b,b) = S(a,a) > S(b,a) = S(b,a), then ( 13.1 ft 
holds ifP(a) < P(6). 

2. The condition 5 < 5q means that the gap penalty —5 has to be sufficiently large. 
Intuitively, the larger the gap penalty (smaller the gap price), the less gaps in op- 
timal alignment so that the optimal alignment is closer to the pairwise comparison 
(Hamming distance). Some methods for determining a sufficient do, as well as some 
examples, are discussed in |26j. We believe that the assumption on S can be relaxed 
so that Theorem 13.11 holds under more general assumptions. 

In this section, we shall assume that there exists letters a, b G A so that the random 
transformation satisfies Al (equivalently, (I3.2p ). We shall show that all other assumptions 
of Theorems 12.11 and 12.21 are fulfilled. We start with the general case |A| > 2, the case 
|A| = 2 will be discussed in the end of the present section. 

3.1 The case |A| > 2 

Let A = {a, b, c\, . . . , q}, where I > 1. The letters a and b are the ones used in the random 
transformation. Let 

l-P{a)-P{bY 3 

With N a and Nf, being the random number of a's and b's in X±, . . . , X n , Yi, . . . , Y n , we 
define the random variables 

U := N b , V:=N a + N b . 
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For any z G X n x X n , thus u(z) and b(z)—u(z) stand for the number of fe's and the number 
of a's in both strings, respectively. The random transformation applied on z changes an 
randomly chosen a into a letter b, hence the transformation increases u(z) by one, whilst 
v(z) remains unchanged. 

Clearly the possible values for U and V are {0, . . . , 2n} and the only restriction to (U, V) 
is that U < V. Hence, in this case S% = S% = {0, . . . , 2n}, 



and for any v, 



S n := {(u, v ) G {0, . . . , 2n} x {0, . . . , 2n} : u < v} 

S n {v) = {0,...,v}. 



For any z G X n x X n , 

P{Z = z\U = u,V = v) = P(Z = z\N a = v -u,N b = u) 



l m,j (x) 







( 2n V 

\u v—u 2n—vJ 



, if u(z) = u, t)(z) 
else 



(3.3) 



where rrij(z) is the number of Cj-colored letters in z. 



The sets U n {v) and V n . Note that U ~ B(2n, P(b)) and V ~ B(2n, P(a)+P{b)). Also 
note that for any v > 0, 

[/(„) ~ B(v,p b ), 



where p& = w^pm ■ Let p := P(a) + Pib) and let 



2np — v 2n, 2np + v 2n 

:= - y/v, Vp b + y/v\ n S n (v). 

Now it is clear that the condition 3) of Theorem 12.11 is fulfilled with k Q — 1. 



With the help of Chebyshev's inequality, it is straightforward to see that for any n, 

P(Y G V n ) > 1 - p(l - p), P(U {V) G Z4(^)) > 1 - p 6 (l - Pb ). (3.4) 

Clearly, there exists v {p b ) so that vp b + y/v < v, whenever v > v Q . Thus, there exists n\ 
so that for every n > ri\ and v G V n , it holds that v > v Q . In particular, for every n > n\ 
and for every pair (u,v) such that v G V„, u G I4 n (v), it holds v > u. 

Lemma 3.1 There exist an universal constant a > and n2 > n% such that for any 
n > n 2 , for any v G V„ and u G U n {y), it holds 

POJ = u V = v) > — . (3.5) 
an 
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Proof. The proof is based on Lemma [2731 Since Ui v \ ~ B(v,pb), by (I2.3P there exists v 
and a positive constant b\ so that for any v > v a , 

P(U [V) = u)> — ^, Vu G U n {v). (3.6) 

biy/V 



Secondly, since V ~ B(2n,p), there exists 722,1 so that for every n > 722,1, 

P(V = v) > -±==, WveV n . (3.7) 
b 2 y 2,n 



Take now 772 > 772,1 so big that for any n > n 2 it holds 2np — y2n > v Q . Then, for every 
v G V„, (13. 6 p and (13. 7p both hold. Thus, for any n > n 2 , v G V n and w G U n (v), we have 

P(tf = «, V = v) = P(U (V) = u)P{V = v)> 1 > 1 > — , 



where the constant a can be taken as 2bib 2 y/p + 1. 



Lemma 3.2 There exists a finite n 3 and a constant d > such that n 3 > n 2 and for 

every n > n% and v G V n , it holds 



Vai[U iv) \U {v) G U n {v)} > dn. (3.1 



Proof. From Lemma [2.31 we know that there exists c\ and v , so that 

Vai[U iv) \U {v) G U n {v)\ > Cl v, (3.9) 

provided v > v Q . Let 713,1 be such that for every n > 723,1 2np — \/2n > v Q . Then, for any 
n > 77,3,1 an any v G V n , we have that v > v so that (I3.9P holds and 



Var [*7 ( „) \U [v) G Un{v)\ > c x (2pn - V2n). (3.10) 



Finally take 77.3 > 77 3) i so big that for a constant d > 0, Ci(2pn — y2n) > dn, provided 
n > n 3 . 



Finally we prove A3 for that particular model. 

Lemma 3.3 Let (u,v) G S n be such that v > u. Let Z ~ P(u,v) Then Z ~ Pf u +i,v) 
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Proof. For any z G X n x X n , let the set A(z) consists of possible outcomes after applying 
the random transformation to z. Since the transformation changes an a into b, the number 
of different outcomes equals to the number of a's in z, thus |*4.(<z)l = ®( z ) ~ Since 
the transformation picks the letters uniformly, we obtain that for any z G A(z), 

P{Z = ~z\Z = z)= 1 (3.11) 
t>(z) -u(z) 

Let the set B(z) consist of all these pairs of strings that could result z after the transfor- 
mation: B(x) := {x G X : x G A(x)}. Since before transformation one of o's in z was a, 
clearly \B(z)\ = u(z). Recall that U and V are the functions of Z. Let Z ~ -P( u ,^). We 
aim to find 

p(Z = z) = P(Z = z\U = u,V = v)= = z\Z = z)P(Z = z\U = u,V = v). 

(3.12) 

Let 

S(u,v) := {z : u(z) = u, K){z) = v}. 
Clearly the right hand side of (13.121) is zero, if 

S(u,v)nB(z) =0. 

This simply means that the string z does not satisfy at least one of the following equalities: 

u(z) = u + 1, t)(z) = V. 

Let us now assume that z satisfies both equalities above. In particular, \B(z)\ — u + 1 
and any element in B(z) is such that the number of o's is u and the number of a's is 
v — it and the number of all Cj equal to that of z. i.e. rrij(z) = rrij(z) Wz G B(z). Clearly 
B{z) C S(u,v). By 0S3D, thus 

P(Z = z\U = u,V = v) = P ( Z = l \ z = z ) p ( z = z\U = u,V = v) 

zet3(z) 

-l l 



v — u \v — u u 2n — v J 

ii - I / 2// \ 1 -j^j 

u\(u +l)(v - u)\(2n - v)\ -pj mj (z) 
(v-u)2n\ ll Qj 

(u + l)\(v-u- \)\{2n-v)\ mj(s) 
2n\ ^ 

2n x ~ j 

u — u — 1 u + 1 2n — v 



m, (z) 

3=1 



v — u\v — u u 2n — v 



irij (z) 
1j 



nmAz) 
1j 



3=1 
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By (O, Z ~ P (u+ i,„). 



Theorem 3.2 Assume that the random transformation satisfies Al. Then there exists 
an universal constant b > and n 6 < oo so that for every n > n%, it holds 

Var[L(Z)] >b-n. (3.13) 

Proof. Let us first check the assumptions of Theorem 12.21 Al holds by the assumption. 
As explained above, the random transformation is such that A2 holds. Let now n 2 be as 
in Lemma [3.1 1 and as in Lemma [3.21 Recall that n\ < n 2 < n% < oo. Hence, for any 
n > n 3 , ( 13. 5 j) and ( 13. 8 p hold; moreover, from (I3.4j) . it follows that with d% = 1 — p(l — p) 
and d 2 = 1 — ]?b(l — the inequalities ( I2.12p hold and for any pair (u, v) where v G V n 
and w G U n (v), we have that v > u. The latter ensures that the random transformation is 
possible and Lemma 13731 now establishes A3. Therefore, for every n > 713, the assumptions 
of Theorem 12.21 are fulfilled and so there exists > n.3 so that for for every n > n.5, the 
assumptions of Theorem 12. II hold with 5 — y. 

We now apply Theorem 12.11 As just showed, the assumption 1) holds for any n > n^; as 
explained in Subsection 13. 1[ the assumption 3) holds with k a = 1. By (13.81) . tp n = dn. By 
Theorem I2.1[ thus, the lower bound (I4.20p exists with 

_ e 2 Q did 2 d 
4 



3.2 The case A = {a, b} 

This case is easier. The only random variable is U, formally we can take V = In. Then 
flU is 

(2n\ _1 

P(Z = z\U = u) = P{Z = z\U = u,V = 2n)= (J I {u(z)=u} . 

Now take V n = {2n} and 

U n = U n (2n) = [2np b - V2n, 2np b - \p2~n\ n S v n . 

Then everything holds true: there clearly exists n\ so that u < v = 2n, whenever n > ni 
and u G U n \ the bound ( 13 .5p holds with a = b\ (and, in fact n~2 instead of n -1 ); the 
proof of Lemma [3721 is simply ( 13 .9p and the proof of Lemma [3731 holds true with o(z) = 2n. 
Thus Theorem 13.21 holds. 
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4 The length of the LCS of random i.i.d. block se- 
quences 

In this section we are interested in the fluctuations of L(Z) for the score function as in 
(11. 2p . where Z = (X,Y) are binary sequences having a certain random block structure. 
This random block structure was first considered in [37]. In the present article, we con- 
sider a random block structure which is a generalization of the model in [37]. We are 
able to show that the length of the longest common subsequence of two sequences having 
this random block structure grows linearly by following the general two-step approach. 
Therefore, we confirm in this setting Waterman's conjecture. 

Ofte in this section x, y stand for binary strings of length n > 0. We start by get- 
ting a bit more familiar with the LCS of x and y. First, note that the LCS of x and y 
and the alignment generating it are not necessarily unique, but its length is unique. 

Example 4.1 Let x = 100101100001101 and y = 111000010101110. The length of the 
LCS of x and y is L\^{x,y) = 10. A candidate for the LCS of x and y is the string 
1000100111. This LCS could have came (but not exclusively) from any of the following 
alignments: 
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Another candidate for an LCS of x and y is the string 1000000110. 

We now introduce the random block model: 

4.1 The 3- multinomial block model 

We say that a block of zeros of length m G Z + in a; is a run of 0's of maximal length 
between two ones, except for the block of zeros at the beginning of x, which only has a 1 
inmediatly to its left. We consider the analog convention for a block of ones of length m in 
x, as well as for any binary sequence. Let I > 2 and g l3 q 2 , q% G (0, 1) be parameters such 
that qi + q 2 + q-z = 1. Let (Wk)k an d (W' k ) k be two i.i.d. sequences of random variables 
taking values on {/ — 1, 1, 1 + 1}, independent of each other, with distribution 

P(W k = h) = P{W' k =li) = qi , V k > 1, i G {1, 2, 3} 

where l\ :— I — l,/ 2 : = I and l 3 := I + 1. Let {uik)k be a realization of (Wfc)&. Let us 
construct x°° = XiX 2 x 3 ■ ■ ■ an infinite binary sequence depending on (Wk)k as follows: 
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We choose i? G {0,1} with probability 1/2, independently from everything else. Then, 
we built up the first block in x°° as a block of t?'s with lenght w±, the second block 
in x°° as a block of (1 — i?)'s with length tu 2 , the third block in x°° as a block of $'s 
with length w 3 , and so on. We built up, in a completely analog way, the sequence 
y°° based on a realization (w' k ) k of (W^) fc . After this, for a given n > 0, let us define 
x((w k )k) ■= x°°[l,n] := x 1 ---x n and y((w' k ) k ) := y°°[l,n] := yi---y n , namely the first 
n- bytes of the infinite sequence x°°, respectively y°°. Note that the last run of the same 
symbol in x((w k ) k ) (resp. in y((w' k ) k )) is not a block according to our definition, or though 
its length r is such that r G {1, ...,/ + 1}. Naturally, x((w k ) k ) (or equivalently y((w k ) k )) 
induces the set X n C {0, l} n of binary sequences having blocks only with length either 
I — 1, Z or Z + l and a last run of the same symbol with length r such that r G {1, ...,/ + 1}. 
Let us denote by X := x((W fc ) fc ) := X 1 ■ ■ ■ X n (resp. by Y := y((W' k ) k ) := Y 1 ■ ■ -Y n ) the 
associated random binary sequence of length n whose realization is an element of X n . The 
process of allocating the blocks can be seen as independently drawing balls of 3 different 
colours from an urn, where a ball of colour % has probability g« to be picked up, i — 1,2, 3. 
That is why we call this the 3-multinomial block model. For k G {/ — 1,1,1 + 1} and 
x G X n , let bk(x) be the number of blocks of length k in x, and denote B k := b k (X) the 
associated random variable. 

Example 4.2 Take I = 3 and let W 1 = 2, W 2 = 3, W 3 = 2, W 4 = 4, W 5 = 3, . . . Suppose 
that we get d = 0, so then x°° = 00111001111000 .... For n = 13, we get the sequence 
x = 0011100111100 such that b2(x) = 2,b 3 (x) = 1 and b±{x) = 1, with a rest at the end 
of length r = 2. 

Let us define the following three new random variables: 

T := + + (4.1) 

U := Bi — Bi-i — Bi + i (4.2) 

R := n-(ZS, + (Z + l)S, +1 + (Z-l)S,_ 1 ). (4.3) 

Given (bi-i(x), bi(x), bt+i(x)), let us denote by (t(x),u(x),r(x)) the solution of the linear 
system 

t(x) = bi(x) + bi-!(x) + bi+i(x) 
u(x) = k(x) - 6j_i(a;) - b l+l (x) 

r( x ) = n — ( Z + + !) ^+i(^) + — 1) &z-i(^) )• 

The other way around, given any realization (t, u, r) G 1? of (T, U, R), let us denote by 

(&i_i(f , u, r), bi(t, u, r), b l+1 (t, u, r)) 

the solution of the linear system: 

&,_i(f,u,r)\ / (2Z + l)/4 -1/4 \, v /-(n-r)/2\ 
6,(t,«,r) = 1/2 1/2 ; + . (4.4) 

b l+1 (t,u,r) J \-(2Z-l)/4 -1/ 4 ;V M / y (n-r)/2 / 

This means that we have a one-to-one correspondence between the random variables 
(Bi_i, B h B l+ i) and (T, [/, R), which will be often used in what follows. 
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The 3-multinomial distribution. We can compute the distribution of X by taking 
into account its block structure. In order to do so, let us define the function 

p(r) := P(W > r), for r G {1, ...,/ + 1}. 

Clearly, p(r) = 1 when r G {1, ...,/ — 1}, but p(l) = q 2 + q 3 and p(l + 1) = g 3 . Now, we 
see that for any x G X n it holds 

P(X = x) = l q t lix) q b 2 2ix) q b 3 3ix) p(r(x)), (4.5) 

where the factor | is needed because to every fixed block-sequence corresponds two se- 
quences in X n , both having the same probability (it is the choosing of the colour of the 
first block with probability 1/2). Moreover, e.g. by the urn analogy, we find that the joint 
distribution of (-B/_i, Bi, Bi + \) can be computed as following: given b\, 62, b 3 G N it holds 

P(B^ = b u B l = b 2 , B l+l = 63) = J2 P ( X = x ) = (\ + I" + b h ) &&&P(r), 

xeSQuMM) V 1 2 3 / 

(4.6) 

where 

S{bi,b 2 ,b 3 ) := {x G X n : b^x) = b u bi{x) = b 2 ,b l+1 (x) = b 3 } 

and r = n — {I — l)b\ — lb 2 — (I + l)b 3 . Note that the factor | disappears. So, combining 
(14. 5 p and ( 14. 61) we naturally get that for x G X it holds 

P(X = = 61, S, = b 2 , B l+1 =h) = \ ^ ^ l S ( blM M)(x)- (4-7) 

Note also that, from (I4.4p . we can even compute the joint distribution of (T,U,R) as 
follows: 

p{u =«.*■=«.«->= ( 6l _ l(( ,„, r) M( ,; r) wtlU , r) )^^^^w). 

(4.8) 

4.2 Fluctuations of the length of the LCS in the 3-multinomial 
block model 

Let Z = (X, Y) be a vector of binary sequences, where each component has the previously 
introduced random block structure. Let us identify U defined in (14.21) with the random 
variable u(Z) as well as the vector (T,R) with the random variable o(Z). Therefore, 
in what follows (U, V) = (T, U, R) and its support S n consists of triples (t, u, r) so that 
P(U = u,T = t, R = r) > 0. We would like to use Theorem 12.11 so we must look for sets 
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U n and V n such that the conditions 1), 2) and 3) are satisfied. For any c > 0, define 




(<?2 - 9i - 93) - cv 7 ^, -(92 - <?i - 93) + c\/n n Z, 



— c-\/n, — V cy/n n z, 



V c := (t 



; c x{i,...,/ + i}) ni sr. 



Note that the notation (resp. 7^ c and V°) means that the set U n explicitely depends 
on the constant c > 0, and has nothing to do with the notation for the complement of 
a set. Recall the right hand side of f |2. 14j) and the 2. Remark after Theorem 12.11 A 
first observation is that, uniformly on n, P(U G U%, V G V£) is bounded by below by a 
constant: 

Lemma 4.1 There exist universal constant c > (not depending onn) and uq < 00 such 
that for every n > no it holds 



The proof is a rather straightforward application of large deviation techniques, and there- 
fore it is contained in the Appendix. We shall also need the following lemma (proven in 
the Appendix): 

Lemma 4.2 There exist an universal constant a > and n\ G (no, 00) such that for 
every n > n\ and (u, v ) G S n R (U n x V n ), it holds 



In what follows, we shall take U n := and V„ = V^, where c > is as in Lemma 14.11 
and we shall take n > n . Recall the definition U n {v) := U n fl S n (v), for v G V n . We show 
now how the conditions of Theorem 12.11 are fulfilled: 

Condition 3). Lets us start by showing that u G U n (v) implies u + i ^ U n {v) for 
i = 1,2,3. Indeed given u G U n (v), it is enough to realize that bi_ 1 (u + i,v) is not an 
integer if i — 1,2,3 but bi^i(u + 4, v) is an integer, where &i_i(u, v), bi(u, v ), bi + i(u, v ) are 
the integer solutions of the system (14. 4p . Next, from Lemma [4. 2\ it follows (among other 
things) that, if n is big enough, w + 4 G S n (v) for every v G V n and u G U n (v), which finally 
implies that U n (v) is of the form (12.131) with k Q = 4 so that the condition 3) of Theorem 
12. II holds with k a = 4. Let us explain all the last argument. For every n > nx, v G V„ and 
u G U n (v), we have that necessarily u + 4 G S n (v), so (u, t> ) G 5„, therefore there exists at 
least one possible outcome x G X n so that u(x) = w and t)(a;) = v. Moreover, from (14.91) . 



P(UeU°,VeV°) >0.9. 



bi,(u,v) — g— < ay/n and bi (u,v)>l, Vi = 1,2,3. 
/1 



(4.9) 
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it follows that bj^iu^v) > 1 for every i = 1,2,3. Thus, in x there are at least one block 
from every size {1 — 1,1,1 + 1}. Deleting from x one bit from a block of the length I + 1 
and adding one bit to the block of length I — 1, we turn both them to the blocks of the 
length I. This transformation does not change the number of blocks and the sum of the 
lengths of all blocks, so we have another possible outcome of X, say x with u(x) = u + 4 
and t>(x) = v . Hence (u + 4, v) G «S n as well. If we keep on doing so, there exist an integer 
m(v,n) > 1 and u (v,n) G Z such that 

U n (v) = {u (v,n) + 4i : % = 1, . . . , m(v, n)}. (4-10) 

It is not hard to see that, for every n > ri\ and v G V n , the integer m(v,n) satisfies 

ZcJn n , . 2cJn n . i , , . 

— 2 < m(v, n) < — — + 2. (4.11) 

4 4 

Condition 2). Recall that {u, v) = (t,u,r). Let Uu\ denote a random variable dis- 
tributed as U given V = v, namely for every z G Z it holds 

P(U (v) = z) = P(U = z\V = v). 

For every n > n\ and t> G V n , let us define: 

p n (i) := P(U( V ) = u (v,n) + 4i \ U( v ) G U n ), i = 1, . . . ,m(v,n). 

The following lemma shows that the ratio p„(i + l)/p n (i) tends to one with speed 0{n~^). 
The proof, given in the Appendix, is heavily based on the following well-known inequali- 
ties: 

— — < ln(l — x), for < x < 0.5 
\n(l + x) < x, for x > -1. (4.12) 



Lemma 4.3 T/iere exists an universal constant K < oo and n 2 > ni snca taat /or every 
n > n 2 and v G V n it holds, 

K p n (i + 1) K , , t . 

1- — <^— ^<1 + — , i = l,...,m(i;,7i . 4.13 



Recall (12. 7p . 2. and 4. Remark after Theorem 12.11 We are now ready to prove that the 
(conditional) variance of U increases linearly, i.e. condition 2): 

Lemma 4.4 There exist an universal constant d > and n 3 > n 2 so that for every 
n > n 3 and for every v G V n it holds, 

Var[U\U G U n , V = v] = Var[U (v) \U (v) G U n \ = Var[C/ ( „) | U (v) G U n {v)\ > dn. (4.14) 
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Proof. Let n > n 2 and v G V n . Recall (I4.10p and (14. lip . There exist constants < d\ < 
d 2 so that for every n big enough, say n > > n 2 , it holds 

, r- 2c\/n 2c\/n , r - 

d 2 V^ < -j- - 6 < —J- + 2 < d 2 ^i. 

From (14. lip , it follows that for every n > and v G V 

d\\fn < m — 4 < m < d 2 \fn. (4-15) 
Take n > n 3 . From (I4.13p . it follows that 

K pJi + 1) K 

V n Pn[l) V n 



for i = 1, . . . m — 1. Hence, for every z, A; G N such that 2 + A; < m, it holds 

K_\ k < Vnii + k) = Pn (i + 1) _ Pn (i + 2) _ _ _ Pn (i + k) < ( l + ^ " 
s/n) ~ p n {i) p n {i) Pn{i + 1) Pn{i + k - 1) _ \ v 7 ™ 

Recall (I4.12p . so that from the last inequality we get 

exp < ^1 < exp (*k) . (4.16) 

Thus, for every 1 < z, j < m, we have 

p n (i) ( $K .A /3tf \ /3/i ,\ ^ 

< exp — =11 -j\ < exp — =m < exp d 2 =: £7. (4.17) 



From (I4.17p . it follows that p n (i) < (mxnip n (i))E, so that 

m 

1 = ) p n (i) < m(minp n (i))E 

i=l 

and we obtain that for every % = 1, . . . , m, it holds 

Pn(«) > (noinp n (i)) > — = > 7= - 

Now, the variance can be estimated as follows. Let u := G W n ]. Without loss 

of generality, let us assume that u{v,n) < u a + 4( !1 ^) = w + 2m + 2. Then for every 
i > ^p> it holds that \u Q + 4z — u\ = u Q + 4z — u > m — 2. Then 



Var[[/( v )|C/ (w ) G Z4] = + 4i - u) 2 p n {i) > ^ (m q + 4z - u) 2 p n {i) 

1=1 ,> 3m 

- 4 

1 m - 2 2 — — — > — x —n. 

4 A ; Ed 2v fo ~ AEd 2 
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Condition 1) . The strategy is to look for a random mapping which satisfies assumptions 
Al, A2 and A3, so that we check condition 1) by applying Theorem 12.21 Recall that to 
apply Theorem 12.21 we additionally need that every point in the set S n D (U n x V„) has 
to have sufficiently big probability so that the condition (I2.17P is fulfilled. The following 
lemma, also proven in the Appendix, shows that the defined sets U n and V n indeed satisfy 
this additional condition: 

Lemma 4.5 There exist an universal constant a > and > 713 such that for any 
n > n 4 and (u, v ) € S n fl (U n x V n ) it holds 

POj = u,V = v) > — . 

an 

Let us finally introduce the random transformation TZ: Take z = (x,y) 6 X n X X n . Then 
in x, we choose uniformly at random a block of length I — 1 (among all the > 1 

available blocks of length / — 1) and turn it to a block of length /. At the same time and 
independent from our previous choice, we choose uniformly at random a block of length 
/ + 1 (among all the bi + \(x) > 1 available blocks of length I + 1) and also turn it to a 
block of length I. We do not perform any change in y. Following our initial convention, 
z := 7Z(z) = (x,y) 6 X n x X n is the sequence after applying this transformation. 

Example 4.3 As in a previous example with I = 3 and n = 13, let us take x = 
0011100111100 such that b2(x) = 2,63(0;) = 1 and b^{x) = 1, with a rest at the end 
of length r = 2. In x, there are only two blocks of length 1 — 1 = 2 to pick from, each with 
probability 1/2, namely 00 (most left one) or 00 (following most left one), and only one 
block of length l + l = 4 to pick from, with probability 1, namely 1111. Let us suppose that 
1Z picks up 00 (most left one) and 1111, then x, will look like this x = 0001110011100. 

Note naturally that bi(x) = 6/(x) + 2, = bi-i(x) — 1 and bi + i(x) = bi + i(x) — 1, so 

that for k Q = 4 the condition ( I2.15P is satisfied. 

We will prove that TZ satisfies assumptions A3 and A2, but we do not prove in this 
paper that 7Z satisfies assumption Al, because it would be too long (see 2. Remark after 
( I2.16P ) and the proof deals with another issues of random sequences comparison, which 
are different from the fluctuations ones we try to be focused on along the present article. 
It means that our main result, i.e. Theorem I4.1[ delivers the linear fluctuations result 
assuming that Al is fulfilled. This is not restrictive, since in [37] assumption Al was 
already proven for the special case gi = g 2 = g 3 = 1/3 (as well as the linear fluctuations 
result). For the sake of completness, we include here the before mentioned result with 
our current notation: 

Lemma 4.6 Let q\ = q 2 = ^3 = 1/3. There exist n < 00 and a constant a G (0, 1) not 
depending on n but on I, such that for every n > no the event 

E[L(Z) - L(Z)\Z] > e 

happens with probability bigger than 1 — exp[— n a ]. 
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Remark 4.1 (for readers who want to dig in the details of |37| ) As we have men- 
tioned, the LCS of two sequences might have associated many different alignments, which 
we call optimal alignments. Lemma 4-2.1 in [37] showed that the set of realizations of 
X and Y such that their optimal alignments leave out at most a proportion go of blocks, 
where go > 9(1-1) > have probability exponentially close to one as n — )■ 00. The entire chap- 
ter 4 in jjS 7 ^ is dedicated to prove results of this type for a set of realizations of X and Y 
and their optimal alignments. Then, in Lemma 4-7.1 in |37f it is shown Lemma \47b] as in 
the following form: the set of realizations of X and Y such that their optimal alignments 
satisfy E[L(Z) — L(Z)\Z] > e x has probability exponentially close to one as n — >■ oo ; for 
an arbitrary €\ > 0. It is important to note that in f£7|/, all the extra conditions of Lemma 
4-7.1 and the extra work through the chapter 4 o,nd chapter 6 are devoted to relate the 
value of t\ with the smallest possible length of the blocks I > in order to get a sharp 
result for the order of the fluctuations of L(Z). This relation is obtained in terms of 
an optimization problem which can be explicitely solved for this particular 3-multinomial 
model where gi = g2 = g3 = 1/3. 

The authors are working on a separate article about how to generalize Lemma \4.6] to the 
case gi,g2,g3 £ (0, 1) such that q x + g 2 + g 3 = 1 (namely, the present and more general 
3-multinomial block model). 

Assumption A3. Recall that A3 presupposes that for any (u,v) 6 S n n {U n x V n ), 
bi-x(u,v) > 1 and bi + \(u, v) > 1, which follows from (14. 9p . The following lemma proves 
A3 : 

Lemma 4.7 Let (u,v) G S n be such that fy_i(it, t>) > 1 and bi + i(u,v) > 1. If Z ~ P( u ,v), 
then Z ~ P(u+a, v ) ■ 

Proof. The random variables U, T and R are independent of Y. Hence, P( u ,v) — Pf u v ) x 
P y , where Pf uv \ is the conditional distribution of X given {(U, V) = (u,v)}, P y is the 
law of Y (actually P x = P y ) and x stands for the product measure. Also the block 
transformation applies to X, only. Thus, it suffices to show that if X ~ P? uv y then 

X ~ P( U+ 4 V )- For proving this, we follow the approach in Lemma [3.31 

The first step of the proof is to explicitly compute an expression for Pf uv ) = P( utr y By 

f )4.7p . we have that for any x 6 S(bi_i(u,v),bi(u,v),bi +1 (u,v)) it holds: 

P(u,v)(x) = P(X = x\U = u,T = t,R = r) 

= P{X = x\Bi-t = bi-x(u,v),Bt = bi(u,v),Bi + i = b i+1 (u,v)) 

=-( 

2 \bi_i(u,v) bi(u,v) bi +1 (u,v) 

The second step of the proof is to actually compute the distribution of X. For that, 
we need to investigate the effect of the block-transformation on the distribution of X. 
Let us fix x G X and denote b\ := 6/-i(x),6 2 := bi{x) and 63 := and (u, v ) its 



(4.18) 
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corresponding triple. Let us define A(x) the set of all strings that are possible outcomes 
after applying the block transformation to x, namely if x G A(x) then necessarily 

bi-i(x) =bt-l, bi(x) = b 2 + 2, b i+1 {x) = b 3 - 1. 

However, not every string y G X n such that bi-i(y) = b\ — l,bi(y) =62 + 2 and 
h+i(y) = bs — 1 belongs to A(x). By using (14.41) . it is straightforward to see that triple 
(fy_i(x), bi(x), bi + x(x)) corresponds to the triple (w + 4, v). Since the block-transformation 
picks up blocks uniformly, then after applying it to x, every element of A(x) has the same 
probability to occur. Formally, 

P(X = x\X = x) = { 6 n l \l G A ^ (4.19) 
ID otherwise 

where 9 G (0, 1] is a constant which depends on x only through (bi, 62, ^3) (or equivalently 
through (u,v)). The block-transformation only changes blocks of length I — 1 and I + 1, 
so 9 = l/(6i • b 3 ). The last ingredient is to define the set B(x) := {x G X : x, G A(x)}. 
The cardinality of this set is 

\B{x)\=^ r 



because, after the block-transformation, each block of length / could have eventually came 
from two previous shorter or longer blocks, so the (^jf^), and can be either of l's or of 
0's, so the 2. 

To end the proof, let us consider X ~ P( u ,v)- Then, we get the corresponding triple 

v), bi(u, v), b i+1 (u, v)) 

depending only on (u,v). To keep the notation light in what follows, let us call b\ := 
bi^iiu, v), b\ '■= bi(u,t,v) and 63 := bi + i(u,v). We aim to find P(X — x) = P(X = 
x\U = u,T = t, R = r). Note that for every x G X n , there are two possibilities: either 
B(x)DS(u,v) =0or 

B{x) c S{u,v). 

The second case holds if and only if 

&,_!(£) = b\ - 1, k(x) = b\ + 2, b l+1 (x) = b* 3 -l 
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or, equivalently x G S{u + 4, v). In this case, by (14.181) and (I4.19p we have: 

P{X = x\U = u,T = t,R = r) = Yl P(X = x\X = x)P(X = x\U = u,T = t,R = r) 



E ^ E P(X = x\U = t,T = t,R = r) 
\B(x)\ 



— Kb, 
xeB(x) 1 3 



If t 



2b\b* 3 \b\ b* b* 3 
1 / t \~ 1 „/&| + 2 



26*6* \ b\ b\b\) \ 2 
1 / / 



2 V&I - 1 b* + 2 6* - 1 



Assumption A2. This assumption means that in the worse case, the length of the LCS 
decreases in A units after the block-transformation. Let z = (x, y) be a realization of Z, 
wi-i be the block of length / — 1 and be the block of length I + 1 in x that 1Z has 
chosen. Note that the decrease in the length of the LCS comes from the following fact: if 
in every optimal alignment producing the LCS of x and y all the bits of W[ + i are aligned, 
then it is clear that deleting one bit of wi + \ will decrease the length of the LCS of x and 
y only by 1. Later, by adding a new bit in wi-\, we cannot get an even lower length of 
the LCS of x and y (in the worst case, we stay the same). Therefore, A = 1. 

Example 4.4 Takel = 2,x = 11100010101101 andy = 11101100101000. ThenL M (x,y) = 
10 could be represented by the alignment 



X 




1 


1 


1 













1 







1 





1 


1 









1 




y 




1 


1 


1 









1 


1 








1 





1 


















Lu(x,y) 
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Suppose thatIZ deletes from the block = 111 (first block, from left to right) one symbol. 
The minimum gain for the length of an LCS is when TZ adds the extra symbol either to 
the fifth block in x of length one (from the left to the right): 
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Lu(x,y) 
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to the sixth block in x of length one (from the left to the right): 
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Lu(x,y) 
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In both cases, we get L 14: (x,y) = 9. 
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We now state the main theorem of the section, about the linear fluctuations of the length 
of the LCS in the 3-multinomial block model: 

Theorem 4.1 Assume that the block-transformation satisfies Al. Then there exists an 
universal constant b > and n 6 < oo so that for every n > n e , it holds 

Var[L(Z)] > b ■ n. (4.20) 

Proof. Let us first check the assumptions of Theorem 12.21 Al holds by hypothesis; 
our block-transformation is such that A2 holds with A = 1 (as discussed above). Let 
now 77-4 be as in Lemma 14.51 Recall that n < ni < n<i < n 3 < n 4 < oo. Hence, 
for any n > n 4 , (14. 9p . ( I4.14p and (I2.17P hold. Moreover, from Lemma 14.14 h holds 
P(U EU n ,V G V n ) > 0.9. The condition gSJ states that for every (u, v) G S n f]{U n x V„}, 
we have that bi-i(u, v) > 1 and bi+i(u, v) > 1. Hence, the block-transformation is possible, 
and Lemma 14.71 now establishes A3. Therefore, for every n > n 4 , the assumptions of 
Theorem 12.21 are fulfilled and, therefore, there exists n§ > n 4 so that for for every n > n$, 
the assumptions of Theorem 12 . 1 1 hold with 5 = ^. 

We now apply Theorem 12 .![ As just showed, the assumption 1) holds for any n > n 5 ; as 
explained at the beginning of Subsection 14. 2[ the assumption 3) holds with k = 4. By 
(I4.14p . i\) n = dn. By Theorem 12 .![ thus, the lower bound (I4.20p exists with 
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A Appendix 

Proposition A.l Given e > there exist a constant c'(e) not depending on n but on e 
and no(e) < oo such that for every n > uq 



P 



n 



<c' 



B l -q 2 ] 



n 



<c' 



+i 



n 



< c' > 1 



(A.l) 
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Proof. It suffices to show that for every e > there exists ji(e) > i — 1, 2, 3 so that 

P(\^^l\<^)>l-e, (A.2) 



for i = 1, 2, 3 and l x := I - 1, Z 2 := Z and l 3 := I + 1. From (IO]l the bound (TAljl trivially 
follows by taking 

c'(e) := max{7i(-),7 2 (-),7 3 (-)}. 

Even more, we shall only show the existence of one-sided bound for 5;: for every e, there 
exists 7(e) so that 

'Hs^^y* 1 -'- (A - 3) 

The other side follows from the same arguments. Since in this proof, we consider q 2 , only, 
let for that proof q := q 2 . Let a, /3, 7 be positive real numbers and m G N. We define the 
random variables & and events A(a, m), P(/3, n) and (7(7, n) as following: 

, . = r 1 if wi = / 

* \ otherwise 
A(a,m) := {£1 H \- £ m < qm + ay/m] 

B(P,n) := jsn+A + A+i < ^ + 

C( T) n) := |s,<3^ + T Vn|. 

Note that for any m, the event B>i-\ + 5/ + P? + i > m implies that in the sequence X, 
there are more than m blocks. That, in turn, means that the m first blocks cover less 
than n bits of X\, X 2 , ... or, equivalently, W\ + ■ • • + W m < n. Let 

m(n,/3) := - + f3y/n. 
A* 

Note that - - u < 0. Thus 

m • 

P(B c (P,n)) < P{W l + --- + W m <n) 

^(W x + --- + W m n 

P A* < A* 

m 



m 




m f n 




8 Vm 




1 


8 [ 1- 





(by (J23D with P(|Wi - /i| < 2) = 1) < exp 

- -^ir^ ■ 

Now, for any e > 0, we can find (5 Q = /3(e) so big that exp[— ^J 2 -] < § . An then, one can 
take n (e) so big that ^= < 1. Hence, for any n > n , 

P(P c (n,/3 D ))<|. (A.5) 
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Let 



Then by (12.11) . for any m 



a :-- 



2g(l - q) 



P(A%a , m)) = P (' & + •;_• + _ q > ) < 1 . 



m / 2 



(A.6) 



Finally, if we define 



then we have 



700, ao) := q/3 + ao\ - + & 



Bi_i + Bi + Bi_i <m((3,n) J /i 

Therefore 

%m)n%n) CC( 7 ,n). (A.7) 

Now take n > no, m := m(/3 ,n) and 70 := 7(A), «o) and use ( 1A.6I) . ( 1A.4I) and ( 1A.7I) to 

get 

P I ^3 > 7ol = P(C c (7o, n)) < P(A c (a , m )) + P(P c (/3 , n)) < e. (A.8) 



Proof of Lemma 14. 1L By Proposition IA.ll for any e > there exist d = c'(e) > and 
no(e) < 00 such that: 



Bt-i + Bi + B l+1 - 2 



> 3c' P 



P 



Bi - - B l+1 - ^{q 2 -qi- q 3 ) 



< 



i=l 

> 3c'^j < P 

< 



Bi, - qt 



B1+1 -«3~ 



>3c' 



In 



> d < 3e 



A -ft* 



-Bi+i - 9377 



> 3c' 



/?? 



> d < 3e 



for any n > no, where as before Zi = / — 1, 12 = I and 1% — I + 1. Then, it directly follows: 

P(rj^Wf>^Vf) >6e 
from where the proof is completed by taking c := 3c' and e = 1/60. 
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Proof of Lemma 14.21 Let n > n . Take (u,v) G S n fl (U n x V„) (By lemma |4TT| the 
set <S n PI (U n x V n ) is not empty). In particular, 

Tl Ti 

t=- + a 1 , u = -(q 2 - q 1 - q 3 ) + a 2 , r G {1, + 1}, 

for o"i,o"2 G [— cyn, Cy/n]. Let us start by showing that bi t (u,t,r) > 1. Recall fi = 
I + Q3 — <?i ■ From (I4.4f) and 1 < r < / + 1 we get 

n 2crx/ + (cri - cr 2 ) + 2r 

bi-i(u,t,r) = q x - + 

fi 4 

n c(l + l) r- 

fi 2 

provided n > n lt i(c). Also, by using (I4.4p and 1 < r < / + 1 we get 

bi(u,t,r) = q 2 - H — > q 2 cy/n > 1 

fi 2 fi 



provided n > ni )2 (c). Finally, also by using (14.41) and 1 < r < / + 1 we get 

n 2o\l + (o 2 — o\) + 2r 



bi +1 (u,t,r) 



fi 4 

n c(l + l) r- 1-2 

> Qs ~Vn >1 

~ fi 2 V 2 ~ 



provided n > 721,3(0). So, for having simultaneously the three lower bounds we need to 
take n > n\ := maxjni^, ni j2 , ^1,3}- 

For the absolute value bounds, we proceed in the same way: 



n 



bi-x(u,t,r) - qi- 



2a x l + (ai - cr 2 ) + 2r 



n 



k(u,t,r) - q 2 - 



o\ + o 2 



n 



k +1 (u,t,r) - q 3 - 



2a x l + (a 2 - o-i) + 2r 



< °^ + 2) v^, for n > (/ - 2) 2 /c 2 



< cy 7 ^, for n > 1 
c(Z + l 



< 



Vn, for n > (/ - 2) 2 /c 2 . 



Thus, all the above three upper bounds hold for n\ := max{l, (I— 2)/c}. In order to obtain 
(14. 9j) . it is enough to take n\ = max{n^,n 2 } and the universal constant a := c(l + 2)/2 
which does not depend on (u,t,r). Without loss of generality, we can take ri\ > n Q . 



Proof of Lemma 14.31 Let n > n\ and consider (u, v) G S n r\ (U n x V n ). Let 

(bi^x(u,v),bi(u,v),bi +1 (u,v)) 
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be the solution of (14.41) . We have already seen that 

bi-t(u + 4:,v) = bi-!(u,v)-l 

kiu + ^v) = b l (u,v) + 2 
bi +1 (u + A,v) = bi + i(u,v)-l. 

Therefore, by using these relations and (14.81) (formula of the multinomial distribution for 

(U,T,R)) we get: 



P{U {v) = u + 4\U {v) eU n ) 
P(U {v) = u\U (v) e Un) 



bi- 1 (u,v)bi +1 (u,v) 

(bi(u,v) + l)(bi(u,v)+2) qi q 3 



(A.9) 



By using Lemma [4. 2[ there exist an universal constant a > such that: 



(bi(u,v) + l)(bi(u,v) + 2) qi q 3 

bi-i(u,v)bi + i(u,v) q| 
(&,(«,«) + l)(6,(w,«) + 2) ' qiq 3 



> 



< 



4 



1 + ^277 + a^E) ( 2 + Q27T + a^/n) 



(A.10) 



qiy + aM (q^+a^ri 



a.\ n 



A 



(A.ll) 



for every n> n\. Recall the inequalities (I4.12p . Let us start by looking at (lA.lOj) : 



1 + q 2 j; + a^Ti) (2 + q 2 ^j + a^E) ?1<& 



4 



> 



1 



v 7 " 



cxp 



1 + 



In 1 



fxa/qi 



-2 In 1 + 



2fia/q 2 



+ ln 1 



I from (I4.12p for every n > max{ 



u 2 a 2 ri 2 a 2 -i \ 

> exp 



wo / 3 8 3 
- + - + - 



1\fn \q\ q 2 93 



ua ( 3 8 3 
2 \q\ q 2 <?3 



— . (A.12) 



n 



Next, let us fix an arbitrary e > 0. Then, there exists n 2 ,i < 00 such that the rest of the 
Taylor's expansion of the function f(x) = e~ x at £ := ^| ^ + ^ + satisfies: 



R(0 :-- 



no 



t2 |U 2 a 2 / 1 



?? 



6 1 
— + — + — 

<?1 92 <?3 



exp 



1 6 1 
- + — + - 



n \Qi <?2 <?3 



< e (A.13) 
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for every n > n%^. Note that n.2,1 does not depend on (u, v) but only on known fixed 
constants. Now, let us look at ( lA.llj) : 



1 + 9277 - a^i) (2 + 9277 - ay/E 



9l93 



1 + 



/W<7i 



(from (|4.12j) for every n > ^ L 



< 

= exp 

-2 Id I i 

< exp 

< 1 + \ia 



1 + 



/Wg3 



^ _ 2fj,a/q 2 

in(1 + W9iV ln f 1+ W93 



in 
2fia/q 2 



n 



n 



ua ( 1 6 1 

n \qi 92 93 

9i 92 93/ Vn 



from (TQ3l) < 1 + ua ( — + — + — ) -|= + e (A. 

V9i 92 93/ V n 



14) 



for every n > 712,2 := max{n2,i, 16 ^ a }■ Finally, from ( IA.12j) and ( IA.14j) we have: 



P(U {v) = u + A\U (v) eU n ) 



i-^i- + - + -i-^< 



,1 6 1 

< 1+fia — + — + — 

Ql <?2 ?3 



7? 



for any arbitrary e > . ^From this last inequality, we can find a constant K > not 
depending on n neither on (w, v) and n 2 bigger than n 2j 2 and n\ such that ( 14. 13ft holds 
for every n > n 2 . 



Proof of Lemma 14.51 The proof is based on the Corollary 12.21 Define 

p ■= (a + c)V^, (A.15) 



where a is as in ( 14, 9p and choose ra 4 > n 3 so big that simultaneously — — C\/n > m ((3) 
and y/nl > 2c//. Here m (f3) is as in Corollary 12.21 From these inequalities, it follows 
that whenever n > n 4 , then 

Cy/n > max { — 77, m }. (A. 16) 

/i 1 2/i J 

Take now 77 > n 4 and (w, t, r) G «S n D (U n x V n ). By (IA. 16j) . 

77 

t> c v / n>m D (/3), 2//f>n. (A.17) 

Use now ( 14. 9 h and the definition of V n to see that for every i = 1, 2, 3, 

IK - tqi\ < \b h - <7i(-)| + _ *l < + qiCy/n < (a + c)y/n < (a + c)^/2flVt, 
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where the last inequality follows from flA.17j) and h\ i = 6^(w,t,r). Apply (12. 5p with /3 as 
in (IA.15D. m = t, pi = q x , p 2 = q 2 , P3 = <?3 and % = b h , j = b h . Then by (JHHD 



P{U = u,T = t,R = r) 



t 

K K K 



where the last inequality comes from the fact that p(r) > q^. Thus Lemma [4.51 is proven 
with 
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